Automatic context-sensitive measurement of the acoustic correlates of distinctive features at landmarks

نویسنده

  • Mark Johnson
چکیده

This paper models speech recognition as the estimation of distinctive feature values at articulatory landmarks 8]. Toward this end, we propose modeling each distinctive feature as a table containing phonetic contexts, a list of signal measurements (acoustic correlates) which provide information about the feature in each context, and, for each context, a statistical model for evaluating the feature given the measurements. The model of a distinctive feature may include several sets of acoustic correlates, each indexed by a diierent set of context features. Context features are typically lower-level features of the same segment, e.g. manner features ((con-tinuant, sonorant]) provide context for the identiication of articulator-bound features ((lips, blade]). The acoustic correlates of a feature can be any static or dynamic spectral measurements deened relative to the time of the landmark. The statistical model is a simple N-dimensional Gaussian hypothesis test. A measurement program has been developed to test the usefulness of user-deened acoustic correlates in user-deened phonetic contexts. Measures of voice onset time and formant locus classiication are presented as examples. 1 Recognition of planning units When a word changes tense, case, or number, the phonemes in the word often change in predictable ways. Distinctive feature notation was developed as a compact notation for these empirical rules of phonological alternation. As such, distinctive features are a model of a data representation used by the human brain to plan speech production 2]. In order to be useful for modeling speech production, a set of distinctive features must be as compact as possible, while providing unique representations for all distinguish-able allophones. In English, for example, there are about forty phonemes, and one to two hundred allophones, all of which may be uniquely represented in terms of about twenty binary distinctive features. In this paper, we will use a binary articulatory feature set capable of representing some syllabic and prosodic information, in addition to traditional allophone distinctions ((gure 1). Since there are fewer distinctive features than phonemes, and since they provide a more parsimonious representation of coarticulation, automatic recognition of distinctive features should be more eecient than automatic recognition of

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

مدل‌سازی بازشناسی واجی کلمات فارسی

Abstract of spoken word recognition is proposed. This model is particularly concerned with extraction of cues from the signal leading to a specification of a word in terms of bundles of distinctive features, which are assumed to be the building blocks of words. In the model proposed, auditory input is chunked into a set of successive time slices. It is assumed that the derivation of the underly...

متن کامل

طراحی الگوریتم بازشناسی واجها با به کارگیری همبسته های آکوستیکی مشخصه های واجی

In the present paper, the phonological feature geometry of the Persian phonemes is analyzed in the form of articulate-free and articulate-bound features based on the articulator model of the nonlinear phonology. Then, the reference phonetic pattern of each feature that consists of one or a set of acoustic correlates, characterized by the quantitative or qualitative values in its phonological re...

متن کامل

A Study of the Relationship between Acoustic Features of “bæle” and the Paralinguistic Information

Language users benefit from special phonetic tools in order to communicate linguistic information as well as different emotional aspects and paralinguistic information through daily conversation. Having functions in conveying semantic information to listeners, prosodic features form the essential part of linguistic behavour, manipulating  them potentially can play an important role in transmitt...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

A Landmark-based Model of Speech Perception: History and Recent Developments

This paper traces some of the history of the development of a model for speech perception in which words are assumed to be represented as sequences of bundles of binary distinctive features. In the model, probability estimates for feature values are derived from measurements of acoustic attributes in the vicinity of acoustic “landmarks.” Landmarks are detected based on amplitude changes in vari...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994